Modeling of Web Robot Navigational Patterns

نویسندگان

  • Pang-Ning Tan
  • Vipin Kumar
چکیده

In recent years, it is becoming increasingly diÆcult to ignore the impact of Web robots on both commercial and institutional Web sites. Not only do Web robots consume valuable bandwidth and Web server resources, they are also making it more diÆcult to apply Web Mining techniques e ectively on the Web logs. E-commerce Web sites are also concern about unauthorized deployment of shopbots for the purpose of gathering business intelligence at their Web sites. Ethical robots can be easily detected because they tend to follow most of the guidelines proposed for robot designers. On the other hand, unethical robots are more diÆcult to identify since they may camou age their entries in the Web server logs. In this paper, we examine the problem of identifying Web robot sessions using standard classi cation techniques. Due to the temporal nature of the data, the classi cation model may vary depending on the number of requests made by the Web user or robot. Our goal is to determine the minimum number of requests needed to distinguish between robot and non-robot sessions, with reasonably high accuracy. Our preliminary results show that highly accurate models can be obtained after three requests using a small set of access features computed from the Web server logs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A density based clustering approach to distinguish between web robot and human requests to a web server

Today world's dependence on the Internet and the emerging of Web 2.0 applications is significantly increasing the requirement of web robots crawling the sites to support services and technologies. Regardless of the advantages of robots, they may occupy the bandwidth and reduce the performance of web servers. Despite a variety of researches, there is no accurate method for classifying huge data ...

متن کامل

Mission-Based Navigational Behaviour Modeling for Web Recommender Systems

Web recommender systems anticipate the information needs of on-line users and provide them with recommendations to facilitate and personalize their navigation. There are many approaches to building such systems. Among them, using web access logs to generate users’ navigational models capable of building a web recommender system is a popular approach, given its non-intrusiveness. However, using ...

متن کامل

Modeling user hidden navigational behavior for Web recommendation

Web users exhibit a variety of navigational interests through clicking a sequence of Web pages. Analyses of Web usage data will lead to discovering Web user access patterns, and in turn, facilitating users to locate more preferable Web contents via collaborative recommendation techniques. In the context of Web usage mining, Latent Semantic Analysis (LSA) based on probability inference provides ...

متن کامل

NaviMoz: Mining Navigational Patterns in Portal Catalogs

Portal Catalogs is a popular means of searching for information on the Web. They provide querying and browsing capabilities on data organized in a hierarchy, on a category/subcategory basis. This paper presents mining techniques on user navigational patterns in the hierarchies of portal catalogs. Specifically, we study and implement navigation retrieval methods and clustering tasks based on nav...

متن کامل

Improving Web Information Systems with Navigational Patterns

In this paper we show how to improve the architecture of Web Information Systems (WISs) using design patterns, in particular navigational patterns. We first present a framework to reason about the process of designing and implementing these applications. Then we introduce navigational patterns and show some prototypical patterns. We next show how these patterns have been used in some successful...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000